BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data - work4ai

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data